Multiple fundamental frequency estimation based on harmonicity and spectral smoothness

نویسنده

  • Anssi Klapuri
چکیده

A new method for estimating the fundamental frequencies of concurrent musical sounds is described. The method is based on an iterative approach, where the fundamental frequency of the most prominent sound is estimated, the sound is subtracted from the mixture, and the process is repeated for the residual signal. For the estimation stage, an algorithm is proposed which utilizes the frequency relationships of simultaneous spectral components, without assuming ideal harmonicity. For the subtraction stage, the spectral smoothness principle is proposed as an efficient new mechanism in estimating the spectral envelopes of detected sounds. With these techniques, multiple fundamental frequency estimation can be performed quite accurately in a single time frame, without the use of long-term temporal features. The experimental data comprised recorded samples of 30 musical instruments from four different sources. Multiple fundamental frequency estimation was performed for random sound source and pitch combinations. Error rates for mixtures ranging from one to six simultaneous sounds were 1.8%, 3.9%, 6.3%, 9.9%, 14%, and 18%, respectively. In musical interval and chord identification tasks, the algorithm outperformed the average of ten trained musicians. The method works robustly in noise, and is able to handle sounds that exhibit inharmonicities. The inharmonicity factor and spectral envelope of each sound is estimated along with the fundamental frequency.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation Emmanuel Vincent, Nancy Bertin and Roland Badeau

Multiple pitch estimation consists of inferring the fundamental frequencies and the salience of the notes forming a music signal over short time frames. This mid-level representation can be exploited as a front-end for higher-level applications, such as music-to-score transcription or chord detection. One approach is to decompose the short-term magnitude spectrum of the signal into a sum of bas...

متن کامل

Multiple Fundamental Frequency Estimation Based on Spectral Pattern Loudness and Smoothness

Two multiple fundamental frequency estimation systems are presented in this work. In the first one (PI1, PI2), the best fundamental frequency candidates combination is found in a frame-by-frame analysis by applying a set of rules, taking into account the spectral smoothness measure described in this work. The second system (PI3) was used to extract symbolic features for audio genre classificati...

متن کامل

Physical principles driven joint evaluation of multiple f0 hypotheses

This article is concerned with the estimation of fundamental frequencies in polyphonic signals for the case when the number of sources is known. We propose a new method for joint evaluation of multiple F0 hypotheses based on three physical principles: harmonicity, spectral smoothness and synchronous amplitude evolution within a single source, which are closely related to source segregation in a...

متن کامل

A New Score Function for Joint Evaluation of Multiple F0 Hypotheses

This article is concerned with the estimation of the fundamental frequencies of the quasiharmonic sources in polyphonic signals for the case that the number of sources is known. We propose a new method for jointly evaluating multiple F0 hypotheses based on three physical principles: harmonicity, spectral smoothness and synchronous amplitude evolution within a single source. Given the observed s...

متن کامل

Multiple Fundamental Frequency Estimation Using Gaussian Smoothness and Short Context

A multiple fundamental frequency estimator is presented in this work. At each time frame, a set of fundamental frequencies is found in a frame by frame analysis taking into account the spectral smoothness measure described in [1] and the information contained in adjacent frames.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Speech and Audio Processing

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2003